Optimal categorization
نویسنده
چکیده
The importance of categorical reasoning in human cognition is well-established in psychology and cognitive science, and one of the most important functions of categorization is to facilitate prediction. This paper provides a model of optimal categorization. In the beginning of each period a subject observes a two-dimensional object in one dimension and wants to predict the objects value in the other dimension. The subject partitions the space of objects into categories. She has a data base of objects that were observed in both dimensions in the past. The subject determines what category the new object belongs to on the basis of observation of its rst dimension. The subject predicts that its value in the second dimension will be equal to the average value among the past observations in the corresponding category. At the end of each period the second dimension is observed and the observation is stored in the data base. The main result is that the optimal number of categories is determined by a trade-o¤ between (a) decreasing the size of categories in order to enhance category homogeneity, and (b) increasing the size of categories in order to enhance category sample size. Keywords: Categorization; Priors; Prediction; Similarity-Based Reasoning. JEL codes: C72. Thanks to Jörgen Weibull for helpful advice. The paper has also bene ted from comments by Philippe Jehiel, Topi Miettinen and Robert Östling, as well as participants at the Third Nordic Workshop in Behavioral and Experimental Economics in Copenhagen, November, 2008 and SUDSWEc in Uppsala, May 2009. Financial support from the Jan Wallander and Tom Hedelius Foundation is gratefully acknowledged. yE-mail: [email protected]. Mail: Department of Economics, Stockholm School of Economics, P.O. Box 6501, SE-113 83 Stockholm, Sweden.
منابع مشابه
Support Vector Machine Parameter Optimization for Text Categorization Problems
This paper analyzes the influence of different parameters of Support Vector Machine (SVM) on text categorization performance. The research is carried out on different text collections and different subject headings (up to 1168 items). We show that parameter optimization can essentially increase text categorization performance. An estimation of range for searching optimal parameter is given. We ...
متن کاملCategorization and Graphical Models -- 1 Psychological Theories of Categorization as Probabilistic Graphical Models
One natural representation of a category C is as a probability distribution (density) over the observed features. In this perspective, optimal categorization amounts to calculating the probability of that distribution given some novel observation. This paper focuses on probability distributions that can be represented using probabilistic graphical models, principally Bayesian networks and Marko...
متن کاملFinding Optimal Combination of Kernels using Genetic Programming
In Computer Vision, problem of identifying or classifying the objects present in an image is called Object Categorization. It is a challenging problem, especially when the images have clutter background, occlusions or different lighting conditions. Many vision features have been proposed which aid object categorization even in such adverse conditions. Past research has shown that, employing mul...
متن کاملEasy categorization of large image collections by automatic analysis and information visualization
A large part of our history as well as our daily lives is captured in visual data. Understanding visual collections requires careful categorization to reveal expected as well as hidden relations. Performing this categorization manually is a demanding and cumbersome process. On the other hand automatic methods still have limitations in performance. An optimal approach brings together the power o...
متن کاملThe Adaptive Nature of Human Categorization
A rational model of human categorization behavior is presented that assumes that categorization reflects the derivation of optimal estimates of the probability of unseen features of objects. A Bayesian analysis is performed of what optimal estimations would be if categories formed a disjoint partitioning of the object space and if features were independently displayed within a category. This Ba...
متن کاملOn the Importance of Parameter Tuning in Text Categorization
Text Categorization algorithms have a large number of parameters that determine their behaviour, whose effect is not easily predicted objectively or intuitively and may very well depend on the corpus or on the document representation. Their values are usually taken over from previously published results, which may lead to less than optimal accuracy in experimenting on particular corpora. In thi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- J. Economic Theory
دوره 152 شماره
صفحات -
تاریخ انتشار 2014